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METHODS AND APPARATUS FOR GENERATING, INCLUDING AND 
USING INFORMATION RELATING TO ARCHIVED AUDIO/VIDEO DATA 

5 FEELD OF THE INVENTION 

The present invention is directed to the archiving, storage, retrieval 
and distribution of audio, video and/or other types of data. 

10 BACKGROUND OF THE INVENTION 

In the modern world there is an abundance of audio and video material 
in existence with more created daily. Common examples of audio/video material 
include e.g., songs, movies, television, radio broadcasts, etc. More mundane 
1 5 examples of audio and/or video information include images of scanned documents, 
tape recordings of court depositions, audio and/or video recordings of business 
meetings, recordings of communications signals, e.g., recordings such as wiretaps 
made by the government, etc. 

2 o Pre-existing audio/video material, such as old television shows, 

movies, radio recordings, songs, etc. represent a wealth of information that is often 
desirable to maintain and preserve for future generations. Similarly, the audio/video 
information created on a daily basis for entertainment and other purposes also 
represents information that, in many cases, it would be desirable to preserve for long 

2 5 periods of time if not indefinitely. 



Material to be archived can be in any of several analog or digital 
formats and on any of several types of media. There is a need to organize and store 
this material so that the content can be preserved and then searched and retrieved for 
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later use. Later use of archived material can require the material be delivered in its 
original, or close to original, format or a different format. 

Long term storage, e.g., archiving, of video and other information 
presents many problems not the least of which is the introduction of errors into the 
information by the failure of the storage medium itself over time. For example, film 
fades over time making it an unsatisfactory long term storage media for movies and 
other information. 

The advent of computers, the relative stability of modern digital 
storage media, and the ability to correct errors introduced by digital storage, e.g., 
through the use of error correction codes and other error correction techniques, makes 
digital storage of information much more attractive than many older storage 
techniques. The ability to index, easily access, and accurately reproduce data which 
5 is stored digitally makes digital archiving of data, including audio/video information 
an ever more attractive option. Analog audio and video content needs to be digitized 
if digital archiving of the material is to be performed. This may be achieved by 
performing digital sampling operations on the analog audio/video information to be 
archived. 

0 

Significant decreases in the cost of digital storage media have further 
increased the attractiveness of digital storage as an archiving technique. However, 
given the vast amounts of information that often need to be archived, in many cases 
the storage of information and the cost of transmitting such information in an 
5 uncompressed state still remains impractical. Accordingly, if information is to be 
archived in a digital form, in many cases there will be a need to use some form of 
data compression prior to storing the data in the archive. 
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Furthermore, audio and video content normally needs to be 
compressed and formatted if one wishes to use modern digital transmission 
techniques to deliver archived information to an end user. Utilized transmission 
techniques may include use of the Internet (using Internet Protocol), 
5 telecommunication channels such as switched networks, satellites and/ or wireless 
communications system. In each of these cases, very specific compression and 
formatting steps are often required for data delivery. 

While proprietary data compression algorithms may be favored by 
1 0 some companies offering archiving services, the use of proprietary compression 

algorithms can present problems in regard to future retrieval of the data, hi the case 
of long term storage of data, the company which created the archive on behalf of a 
client may be out of business by the time the archived information is to be accessed. 
This can make it difficult to obtain the required decompression programs in a form 
1 5 which can be run on computers which are in use at the time data retrieval and thus 
decompression is to be performed. 

While volume presents one problem with the archiving of audio/video 
information, the variety of input and output formats which need to be supported 

2 0 create other problems. In many cases, the archived information will have to be output 
in multiple formats some of which may not even exist at the time the digital archive is 
initially created. For many users of an archive system, e.g., video archiver's, the 
ability to output archived information to different formats with little or minimum loss 
in image quality can be important. However, for other archive users, e.g., people 

2 5 archiving video corresponding to business meetings or court proceedings, a much 
higher degree of image degradation, corresponding to higher compression and thus 
reduced storage and transmission costs, may be acceptable. 
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The ability to search and access an archive is important particularly as 
the size of an archive increases. Various known data storage systems index data as it 
is added to an archive but do not include support for indexing additional aspects of 
the archived information after the archive is created. Such indexing can significantly 
facilitate the subsequent retrieval of data. Unfortunately, in the case of long term data 
storage, the information included in an index created at the time of archiving often 
proves inadequate for previously unforeseen search and retrieval needs. 

From the above discussion, it should be apparent that digitized and 
compressed data can be well suited for archiving, as it can provide reliable long term 
storage and allows for electronic cataloguing and indexing in addition to computer- 
assisted functions such as search and retrieval. 

Known systems for digitizing audio and video content and producing 
digital files therefrom normally put the material into the specific end-user format 
required for a particular application, e.g., radio or television broadcast. 
Unfortunately, such application formats tend to be optimized for a particular purpose, 
e.g., to produce a data stream limited by the capacity of a particular communications 
channel, and not for the purpose of subsequent conversion from the initially generated 
format to other formats. Accordingly, from an archiving perspective, storage of data 
in an end user format designed for a particular application is often unsuitable since 
conversion from the first end user format to another end user format will, in many 
cases, result in image and/or audio quality significantly below what could have been 
obtained from the original information source. 

With modern systems, the solution has been to preserve the original 
source of the audio/video data and to re-submit the source for digitization if a new 
digital format is required for another application or at a later time. 
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Thus, in known systems, to generate data in another format at a later 
time requires that the original source of the audio/video content, e.g., the analog film, 
be preserved for subsequent use or that the data in the original end user format be 
converted to another end user format which can negatively impact ultimate image 
quality. 

Since storage of information in non-digital form has the problems 
discussed and, in addition, since re-digitizing of analog source material is an 
inefficient use of resources since previously performed work is repeated each time a 
new application format needs to be supported, it is desirable that any digital archiving 
technique support multiple output formats. 

In view of the above discussion, it is apparent that there is a need for 
improved digital archiving methods and apparatus. 

SUMMARY OF THE INVENTION 

The present invention is directed to methods and apparatus for the 
archival storage and retrieval of audio/video information. In accordance with the 
present invention audio and/or image, e.g., video, information is digitized, 
compressed and stored in an intermediate archive format (IAF), which preserves the 
content at a high-enough quality for subsequent retrieval and conversion into various 
formats required at the time of use or distribution. For purposes of the present 
application images include, e.g., still images, pictures, frames, or portions thereof. 

In accordance with the invention, a single capture operation is 
performed with ancillary metadata being generated and added to facilitate subsequent 
searching, indexing and format conversion. 
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Captured data content is catalogued and indexed at or subsequent to 
the creation of an IAF file that includes the archived, e.g., encoded, information. The 
IAF includes a family of audio-video digital encoding formats based on public 
standards, e.g., MPEG (MPEG-1, MPEG-2, MPEG-4, etc.), JPEG, DV, etc. For each 
5 encoding format multiple encoding profiles and/or levels may be supported with 
different profiles/levels corresponding to different image and/or audio file quality 
levels. 

The encoding format used in any particular application is determined 
from information provided by the archive system user, data received with the 
information to be archived, and/or information generated by analyzing the 
information to be archived. Pre-existing format information and input image quality 
information are well suited for determination from automated analysis of input data. 
Content information may also be determined by an automated analysis operation prior 
to encoding. Information supplied by the system user, or generated automatically by 
analyzing the input information, which is used in selecting an encoding format and/or 
encoding quality level, comprises such things as: quality information, intended use 
information, image source information, and data storage limitation information. In 
some embodiments information used to make encoding format/quality decisions is 
obtained by querying the user in an interactive manner. 

Quality information may indicate the quality of the source data and/or 
the desired minimum output quality level of the achieved data after being retrieved 
and/or transcoded. Intended use information may indicate, e.g., an archiving use, a 
2 5 distribution use such as digital TV distribution by cable, satellite, video-on-demand, 
CCTV, etc, or some other use such as medical application, video conference, 
surveillance, a drafting program, IP browser use, wireless delivery, e.g., radio 
broadcast, etc. Image source information may indicate the media on which the 
information to be archived was previously stored and/or a format in which the 



15 
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information to be archived was previously stored. Examples of media previously 
used to store information include digital audio tape, optical platter, laser disk, CD, 
DVD, analog VHS tape, floppy disk, etc. Examples of formats in which the 
information was previously stored include, e.g., VHS, motion film, NTSC, DV, etc. 

5 

For each encoding format which may be used, multiple encoding 
quality levels may be supported. For example, with regard to at least some encoding 
formats a lossless or near-lossless encoding quality level is supported, a contribution 
quality level is supported and a distribution quality level is supported. At the near- 
1 o lossless quality level, the quality after decoding encoded data will differ from the 
input by not more than a small pre-selected amount. The contribution quality level 
provides a level of quality sufficient for production or post-production processing 
while distribution quality level corresponds to the quality level at which information 
is to be distributed to end users. 

15 

Prior to encoding, the particular encoding scheme is selected in 
accordance with the present invention to optimize a tradeoff between storage 
constraints if any are specified and end use quality requirements. The encoding 
quality level is also selected. 

20 

In order to use storage space efficiently, in various embodiments the 
encoding quality level is selected to be an encoding quality level which will preserve 
the image at a level of quality lower than the input level of quality but at least as high 
as the minimum level of image quality at which the image is to be preserved. 

25 

In other cases where a storage constraint is specified, after selection of 
an encoding format the quality level will be selected to be the highest quality level 
possible given the data storage constraint. 
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In cases where preservation is the goal, the encoding format and 
encoding quality level are frequently selected to insure that the input image and/or 
audio quality is adequately preserved. In fact, where image enhancement operations 
are performed as part of an image processing operation prior to encoding, a stored 
5 image may be stored at a quality level which is better than the input quality level. 

Prior to encoding, the data to be archived may be analyzed to 
determine such things as coding complexity and to create true motion vectors which 
indicate actual differences between images in a sequence of images to be encoded. 

1 0 The true motion vectors may differ from those created by the selected encoding 
scheme. Encoding complexity information, true motion vectors and other 
information are included with the encoded information generated by encoding the 
information to be archived. The encoding complexity information is used, in various 
embodiments, to select encoded images and/or transcoded images to be previewed. 

15 In such embodiments images with high levels of coding complexity are selected for 
preview purposes since they give some indication of the worst case encoding result. 
Encoding complexity information, content information, true motion vectors and other 
information not normally encoded in a set of encoded information but which are 
included in an IAF file in accordance with the invention are used in various 

2 0 embodiments to suggest transcoder output formats. The suggested transcoder output 
formats may be presented as a list to a system user from which the user may make a 
format selection. 

The method of the present invention and the use of an intermediate file 
2 5 format eliminates the need for repeated capture of the same material for transfer to a 
multiplicity of end-user formats. It also eliminates the need to store the source 
material and/or copies of the material in a plurality of application formats. This is 
made possible by the use of an intermediate coded format that is amenable to 
indexing and transcoding, i.e., conversion to a different coded format. 
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Since the archiving method of the present invention is based on a 
single capture of source material regardless of the number of different output formats 
to be supported, the time and cost of this labor-intensive step is reduced as compared 
5 to systems which include a separate capture operation for each supported output 
format. 

In accordance with the present invention indexing and the creation of 
metadata can be decoupled from the operation of encoding audio/video information to 

10 be archived. In accordance with the present invention, subsequent to encoding, the 
encoded material can, and in various embodiments is, catalogued and indexed to 
create a database of metadata. Indexing can be performed multiple times after 
encoding, e.g., three or more times. The indexing process involves fully or partially 
decoding encoded data to be indexed and performing an indexing operation thereon to 

1 5 generate metadata in the form of indexing information. The metadata includes 
various descriptive information about the contents, format, etc., of the data being 
archived. Generated metadata is incorporated into the corresponding IAF file. 

Each IAF file includes the metadata information in the file in a manner 
2 0 which allows the metadata to be used for the efficient computer-assisted search and 
retrieval of archived information also included in the IAF file. In addition to being 
stored in the IAF files, the metadata is used to create indexes into the IAF files to 
facilitate fast access to the archived material in the IAF. The generated indexes 
represent additional metadata which may, and in various embodiments is, 
2 5 incorporated into the relevant IAF files. The indexing metadata associated with each 
archive file is also used to create a global index for an archive including multiple IAF 
files. Thus, in at least some embodiments, the indexing metadata stored in individual 
IAF files is duplicated, e.g., copied, to a global archive database which can be used to 
identify IAF files to be accessed in response to user retrieval requests and/or searches. 
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Since, in various embodiments, indexing is done after encoding of the 
information to be archived, the indexing and creation of metadata need not be 
performed at the rate of encoding. Thus, in at least some embodiments indexing is 
5 performed at a rate which is considerable lower the rate at which data is created, e.g., 
at less than half the data creation rate. 

Since indexing can be performed independently of encoding, if 
desired, indexing and metadata creation can be performed using a separate system 
1 0 dedicated to the indexing process. Notably, at any time after encoding of the archived 
audio/video information the metadata and generated indexes can be updated as the 
need arises or as the desirability of additional search fields becomes apparent. Newly 
generated metadata can be added, e.g., appended, to an IAF file without modifying 
the existing metadata in the file or the encoded archived information itself. 

15 

Information in an archive of IAF files created in accordance with the 
invention can be searched, retrieved, and delivered to the end-user in any of a wide 
range of user specified formats. Multiple output formats are supported by the use of 
transcoding, e.g., re-encoding the archived information into the desired output 
2 0 formats as required. In this manner, retrieved information can be delivered as a low- 
resolution, low-bandwidth stream suitable for browsing, or as any other desired 
format with the quality and resolution being dictated by the end-user application at a 
time subsequent to the creation of the archive. 

2 5 Various additional features and advantages of the present invention 

will be apparent from the detailed description which follows. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates an exemplary method for generating an archive of 
audio and/or video information in accordance with the present invention. 

5 

• Fig. 2 illustrates an exemplary method of the present invention for 
retrieving and distributing data stored in an archive of the type generated by the 
method of Fig. 1. 

1 o Fig. 3 illustrates an audio/video archiving and retrieval system 

implemented in accordance with one embodiment of the present invention. 

Fig. 4 illustrates a control module suitable for use in the system 
illustrated in Fig. 3. 

15 

Fig. 5 illustrates a capture and compression module suitable for use in 
the system illustrated in Fig. 3. 

Fig. 6 illustrates a transcoder suitable for use in the system illustrated 

2 0 in Fig. 3. 

Fig. 7 illustrates an exemplary intermediate archive format (IAF) file 
generated in accordance with the present invention. 

2 5 Fig. 8 illustrates an exemplary archive including multiple IAF files. 



Fig. 9 illustrates an audio/video archiving and retrieval system 
implemented in accordance with another embodiment of the present invention. 
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DETAILED DESCRIPTION 

As discussed above, the present invention is directed to methods and 
apparatus for the archiving, storage, retrieval and distribution of audio, video and/or 
other types of data. 

Fig. 1 is a flow chart 100 illustrating the steps of an archive generation 
method of the present invention. The method may be implemented using systems 
such as those illustrated in Figs 3 and 9 which will be discussed in detail below. 

The archive generation method begins in step 102 wherein the archive 
generation process is started. Then in step 104 user input is received from an archive 
system user. The user input indicates, e.g., one or more application formats to be 
supported, the minimum output resolution to be supported, the user's desire to 
minimize loss in audio or image quality, the user's preference for a particular level of 
data compression and/or user specified data constraints. 

Next, in step 106, the information, e.g., audio and/or video information 
to be archived is received. The information may be digital information 101 and/or 
analog information 103. 

From step 106 operation proceeds to step 107 wherein an encoding 
format and encoding quality level to be used is determined as a function of various 
input information and/or information generated by analyzing the received information 
to be archived. The encoding format is selected from a family of supported encoding 
formats. 



The IAF is a family of standard digital audio/video formats and 
associated encoding profiles and/or levels, such as MPEG, JPEG, DV, and/or others 
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with one or more quality levels being supported for various formats. The IAF 
supported formats allow metadata to be incorporated with the encoded A/V data, e.g., 
as auxiliary data, without interfering with the ability of a decoder which supports the 
standard encoding format to decode at least the encoded A/V data included in the 
5 created file. The encoding technique and related encoding parameters used to 

generate an IAF file may be selected to be of sufficient quality, resolution, frame rate, 
etc., to ensure that any downstream transcoding or reformatting will yield 
substantially the same result had the original material been coded directly to that end 
format. In one embodiment, in order to achieve this, the archive encoding format 

1 0 selected for any given archiving operation is chosen to be the best of, or better than, 
any anticipated downstream format requested or required by the archiving system 
user. If preservation is a requirement, then the encoding format may be one of 
digitization with no compression, lossless compression, or near-lossless compression. 
In other cases, the encoding format used to generate an IAF file will be selected from 

15 a group of supported IAF encoding formats to provide the best tradeoff among 

criteria such as the degree of visual loss which is acceptable, storage requirements, 
anticipated use, and other user preferences or requests. 

The encoding format used in any particular application is determined 
2 0 from information provided by the archive system user, data received with the 
information to be archived, and/or information generated by analyzing the 
information to be archived. Pre-existing format information and input image quality 
information are well suited for determination from automated analysis of input data. 
Content information may also be determined by an automated analysis operation prior 
25 to encoding. Information supplied by the system user, or generated automatically by 
analyzing the input information, which is used in selecting an encoding format and/or 
encoding quality level, comprises such things as: quality information, intended use 
information, image source information, and data storage limitation information. In 
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some embodiments information used to make encoding format/quality decisions is 
obtained by querying the user in an interactive manner. 

Quality information may indicate the quality of the source data and/or 
the desired minimum output quality level of the achieved data after being retrieved 
and/or transcoded. Intended use information may indicate, e.g., an archiving use, a 
distribution use such as digital TV distribution by cable, satellite, video-on-demand, 
CCTV, etc, or some other use such as medical application, video conference, 
surveillance, a drafting program, IP browser use, wireless delivery, e.g., radio 
broadcast, etc. Image source information may indicate the media on which the 
information to be archived was previously stored and/or a format in which the 
information to be archived was previously stored. Examples of media previously 
used to store information include digital audio tape, optical platter, laser disk, CD, 
DVD, analog VHS tape, floppy disk, etc. Examples of formats in which the 
information was previously stored include, e.g., VHS, motion film, NTSC, DV, etc. 

As mentioned above, for each encoding format which may be used, 
multiple encoding quality levels may be supported. For example, with regard to at 
least some encoding formats a lossless or near-lossless encoding quality level is 
supported, a contribution quality level is supported and a distribution quality level is 
supported. At the near-lossless quality level, the quality after decoding encoded data 
will differ from the input by not more than a small pre-selected amount. The 
contribution quality level provides a level of quality sufficient for production or post- 
production processing while distribution quality level corresponds to the quality level 
at which information is to be distributed to end users. 

In order to use storage space efficiently, in various embodiments the 
encoding quality level is selected to be an encoding quality level which will preserve 
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the image at a level of quality lower than the input level of quality but at least as high 
as the minimum level of image quality at which the image is to be preserved. 

In other cases where a storage constraint is specified, after selection of 
5 an encoding format the quality level will be selected to be the highest quality level 
possible given the data storage constraint. 

In cases where preservation is the goal, the encoding format and 
encoding quality level are frequently selected to insure that the input image and/or 
1 0 audio quality is preserved to the extent possible. In fact, where image enhancement 
operations are performed as part an image processing operation prior to encoding, a 
stored image may be stored at a quality level which is better than the input quality 
level. 

1 5 One of the features of the present invention is the automated selection 

of a video/audio encoding format and associated parameters suitable for an indicated 
user or application. Alternatively, the system can suggest formats/encoding levels to 
a system user for their review and approval. 

2 o Encoding format/level selection can be done base on an analysis of 

needs and features, or as a look-up in predefined tables, or as a learning process. The 
transcoder, discussed below, may include similar functionality for making 
transcoding format/quality level selections and/or suggestions. 

25 In the exemplary capture and compression module discussed below, an 

encoding format and/or encoding parameters appropriate for the best of a number of 
anticipated applications, subject to a minimum threshold of high quality suitable for 
an archive may automatically be determined. 
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Some exemplary pairings between use or application (on the left) as 
specified by received input or data analysis and the selected or suggested video 
encoding format/level (on the right) are: 

5 o Digital TV distribution use --> MPEG-2 MPfSML format/level 

o Digital TV production use -» MPEG-2 422P@ML 
o High definition TV -> a high definition mode of MPEG-2 
o Medical applications -> lossless or near-loss encoding 
o Head-and-shoulders, videoconference, surveillance, etc, -> MPEG-1 or 
10 H.263 

o Browsing delivery by transcoder -» a browsing format such as Windows 

Media, Real, or QuickTime with bitrate set according to user's capabilities 

and bandwidth 
o Wireless delivery -> MPEG-4 

15 

Information on previous encoding formats and/or data storage medium 
used to store the information being processed may also be considered and used when 
making an encoding format selection in accordance with the present invention. 

2 0 As mentioned above, analysis of the information to be encoded may be 

performed as part of the encoding format selection process. Information generated as 
part of the analysis operation may include encoding complexity information 
expressed in terms of a number of different levels, motion estimates between frames, 
format and/or content analysis. Some or all of this information may be used as 

2 5 metadata which can be added, e.g., tagged, to the encoded data created as part of the 

encoding process. 

A capture and compression operation is performed on the received 
information to be archived in step 108. The capture and compression operation 108 

3 o involves an A/D conversion in the case of analog input. Digital input or digital data 

produced by an A/D conversion operation is then followed by an encoding process 
using the encoding format/level determined in step 107. As part of the encoding 
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process, the digital data is encoded according to a format and to a quality level 
determined as a function of various input information and/or information generated 
by analyzing the data to be archived. The encoding operation produces encoded 
digital data 109 which is in an IAF encoding format. 

5 

Optional preview step 108* can be used to preview images represented 
by the encoded digital data 109. Rather than preview all images or randomly select 
images for preview, in one embodiment images which are identified in the above 
discussed analysis operation, as being complex from an encoding perspective, e.g., as 
1 0 indicated by the images being assigned to the top encoding complexity level, are used 
for preview purposes. In this manner the result of the most complex images to 
encode, which are likely to result in the worst image quality, can be identified and 
previewed. If the preview in step 108' indicates an unacceptable image quality a user 
can select a different encoding format to be used for archiving purposes. 

15 

In addition to generated encoded data 109 in the selected IAF 
encoding format, the capture and compression operation produces metadata 1 1 1 
which may includes, e.g., encoding complexity information (generates as part of the 
encoding operation or the analysis performed to select the encoding format), true 

2 0 motion vectors (which differ from motion vectors created according to the selected 
encoding format), archive creation date/time information, encoding format 
information and/or various input, e.g., descriptive information and/or encoding 
parameters, received from the user. After encoding, the encoded data 109 is supplied 
to an analysis step 1 10 wherein the information is processed and analyzed to generate 

2 5 indexing information, e.g., additional metadata. Processing may include decoding of 
the encoded image data while the analyzing may involve, e.g., examining the decoded 
image data to determine pictorial content. The generated indexing information can be 
used to facilitate subsequent accessing of the encoded data 109 and computerized 
searching operations. The metadata generated in step 1 10 is supplied to an IAF file 
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generation step 1 12 which also receives the metadata 1 1 1 and encoded data 109 
output by step 108. 

In step 1 12, the metadata from steps 108 and 1 10 and the IAF encoded 
5 data 1 09 is processed to generate an IAF file 1 1 3 . 

The LAP file 1 13 includes the compressed audio/video bitstream plus 
ancillary metadata that describes, tags, or otherwise specially marks the bitstream or 
bitstreams which, along with the metadata, are multiplexed into an LAP file. 

10 

In accordance with various embodiments of the invention, the IAF file 
1 13 has the property that it can be accessed as a self-contained archive for a particular 
piece of audio-video material or as content plus local indexing and metadata 
information that is just one part of a complete archive of a number of stored audio- 
1 5 video elements that are all searchable and retrieved through a global database 
manager connected to the whole archive or some subset thereof. 

Metadata stored in the IAF file 1 13 is also directly editable, spliceable 
and extensible while the A/V content of an IAF file can, in some cases, be modified 
2 0 or appended to. Furthermore, data streams in an IAF file 1 13 and streams can be 

separated or combined, with the result being another compliant IAF file or additional 
entry within an existing IAF file. 

In accordance with the present invention, the metadata and other 
2 5 ancillary information incorporated into an IAF file may be in a proprietary format. 
However, this data is embedded into the bitstream which makes up the LAP file using 
fields normally ignored by a standard decoder. Thus, the IAF file 1 1 3 has the further 
property that this container of audio/video material is decodable by a standard 
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decoder thereby avoiding the risk of loss of access, and hence loss of data, that would 
occur with a proprietary format and associated proprietary software. 

The IAF file is supplied in step 1 14 to an archive storage manager 
5 which is responsible for placing the file in the archive. Then in step 1 1 6 the IAF file 
is stored on the archive media, e.g., for future retrieval. In step 118, after the IAF file 
has been stored, the archive generation process stops. 

Fig. 2 illustrates a flow chart 200 showing the steps of a method of 

1 o retrieving and distributing data stored in an archive in accordance with the present 

invention. As illustrated, the retrieval and distribution method begins in step 202, 
e.g., with an archive system such as the system illustrated in Fig. 3, being readied for 
the retrieval process. From step 202 operation proceeds to step 204 wherein user 
input specifying data to be retrieved, and input indicating the desired output format, is 
1 5 received. Data retrieval information may be obtained from a user as part of an 
archival browsing process where the user previews portions of archived data and 
selects data to be retrieved or by the user providing some information, e.g., file name, 
content description, or other indexed information which can be used to identify 
archived data for retrieval purposes. 

20 

From step 204 operation proceeds to step 206 wherein a search of the 
archive is initiated, e.g., by an archive management module, using the information 
obtained from the user. In step 208 the information to be retrieved is located as the 
result of the search. Using the location information, in step 210, e.g., the user 

2 5 specified information, in the form of IAF encoded data 2 1 1 , is retrieved from the 

archive, hi optional step 216, the system user is provided the opportunity to preview 
the retrieved data prior to, or in parallel with, the retrieved encoded data being subject 
to transcoding in step 212. 
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In transcoding step 212, the retrieved IAF format data is converted to 
the output specified by the user. Prior to the user specifying the output format the 
transcoder may or trancoder control circuitry may suggest to the user one or more 
recommended output formats. The output formats may be based on end use and/or 
5 distribution channel information. The formats and/or quality levels to be 

recommended may be selected in the same manner as discussed above in regard to 
selecting an appropriate IAF encoding format. Transcoder suggested data output 
formats may be displayed to the user in the form of a list from which the user can 
select one or more desired output formats. 

10 

In most cases transcoding to the user specified output format(s) will 
involve decoding the retrieved data 21 1, in substep 213, and recoding the data to the 
format specified by the user, as in substep 214. In cases where the IAF format is the 
same as, or similar to, the format specified by the user decoding of the data may not 
15 be necessary to place it into the format specified by the user. The result of 

transcoding step 212 is application format encoded data, i.e., data coded to the format 
specified by the user, which in step 218 is supplied to a data delivery system, e.g., 
cable system, television broadcaster, Internet service provider, etc. 

2 o With the data selected by the user having been retrieved and delivered 

in the user specified format, the information retrieval process halts in step 220. 

Notably, while the retrieval process of the present invention has been 
described with the user selecting a single output format, in some embodiments a user 
2 5 may specify multiple output formats for a single set of retrieved data. In such 

embodiments, in transcoding substep 214 the retrieved decoded data is encoded into 
each of the formats specified by the user. This results in the same retrieved data 
being converted into multiple output formats without the need for multiple capture 
operations. Alternatively, to support multiple output formats the retrieval method 200 
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may be performed repeatedly with the same data being retrieved each time but a 
different output encoding format being selected by the user each time. 

Fig. 3 illustrates an audio and/or video archiving and retrieval system 
(AVARS) 300 of the present invention suitable for use in performing the methods 
illustrated in Figs. 1 and 2. The AVARS 300 is illustrated in Fig. 3 coupled to a 
delivery system 324. The delivery system 324 may be anyone of a plurality of known 
communications systems capable of delivering digital and/or analog A/V signals 
output by the AVARS 300. 

The AVARS 300 comprises a control module 320, a capture and 
compression module 302, IAF wrapper module 304, archive storage manager module 
306, a first preview module 310, an analysis and indexing module 314, search and 
retrieval module 314, transcoder 316, a second preview module 322, and a user 
input/output device 318. Each of the modules used in the AVARS 300 may be 
implemented using hardware, software, or a combination of hardware and software. 

The control module 320 is responsible for controlling general 
operation of the AVARS 300 in response to input received from a system user and 
data to be processed. In particular, the control module 320 is responsible for 
selecting, based on user supplied information, the particular encoding scheme to be 
used for encoding received input, out of a plurality of supported IAF supported 
encoding formats. The control module 320 is also responsible for controlling the 
indexing of data as part of the archiving process, data retrieval, and transcoding 
operations, as well as other processing operations. 

Fig. 4 illustrates an exemplary control module 320 suitable for use as 
the control of the AVARS 300. As illustrated the control module 320 includes a 
central processing unit 402, memory 404 and an input/output (I/O) interface 406 
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which are coupled together by bus 408. The I/O interface couples the control 
module's various components to external modules and devices such as the user 
input/output device 316, capture and compression module 302, analysis and indexing 
module 314, search and retrieval module 314, first and second preview modules 310, 
5 322 and transcoder 316. 

Control module operation is controlled by CPU 402 under direction of 
various routines stored in the memory 404. The stored routines include a user 
interface routine 410, encoding format selection routine 412 and learning routine 414. 

10 In addition to various routines, the memory 404 includes information 416 used by the 
routines 410, 412, 414. The stored information 416 includes a database of supported 
encoding formats which lists both IAF encoding formats and formats which may be 
generated by transcoding, various encoding parameters, and information relating user 
input with one or more supported IAF encoding formats and/or transcoder encoding 

1 5 formats, e.g., data output formats. 

The control module 320 has an intelligent user interface 410 and 
learning routine 414, which are used to choose the specific type of compression to be 
performed as part of IAF encoding and parameters needed to control the selected 

2 0 encoding circuitry. The control module 320 ensures that the type of compression 
used, if any, maintains sufficient quality of the archived information that the 
anticipated end-user application(s) specified by the user prior to, or as part of the 
archiving process, can be supported using the archived information. The single 
capture technique of the present invention made possible by the control module's 

2 5 selection of a suitable intermediate file encoding format, avoids the need for 

additional access to original source material thereby avoiding the added cost, time, 
and wear on that material. The intelligent user interface which can prompt the user for 
information and receive information via user input /output device 318 can be as 
simple as the user directly selecting format and parameters from a list, or a more 
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sophisticated question and answer approach, or an automatic selection based on 
anticipated end-user application. In some embodiments the control module uses a 
database of pre-specified formats and parameters to determine the IAF encoding 
format while in other embodiments it learns from user input, e.g., encoding format 
5 and parameter selections, and subsequently builds and refines such a database: In 
various embodiments the user interface queries for specific parameters, for 
anticipated use, and/or storage availability. In other embodiments, there is no query 
at all, in which case parameters are preset or input from an automation script allowing 
unattended capture and compression of information to be archived. 

10 

The control module 320 is where the intelligent user interfaces and 
algorithms for sophisticated control of the individual modules 302, 304, 306, 314, and 
3 1 5 reside. There may be one or more copies of the control module as required by the 
specifics of the system implementation. For example, if the capture module 302 and 
1 5 transcoder 3 1 6 are independent of one another or in different locations they can each 
include their own control module 320. In the control module 320 there are features 
common to multiple modules and features specific to each module as applicable. The 
control module: 

(a) Intelligently controls the other modules according to user input or an 

2 o automation script and does things such as choose modes, set parameters, make 

decisions, steer data, etc. 

(b) Collects information and statistics from the other modules and process that 
information along with the user or automated input to further control the 
modules and to report back on their performance and the performance of the 

2 5 system as a whole. 

(c) Analyzes the existing metadata, permits the input of additional metadata as 
well as information used for making encoding format/level suggestions and 
decisions, and can output new metadata to be added to the archive record. 

(d) Performs additional operations depending on the mode of operation. 
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Two supported modes of operation are a user-driven mode of operation 
and an automated mode of operation. In the user-driven mode, the control module 
intelligently interacts with the user in one or more of the following ways: 



5 (a) The user selects modes and parameters directly. 

(b) The user is queried for the specific modes and parameters. 

(c) The user is queried about the content and its intended use, then appropriate 
modes and parameters are determined. 

(d) The user selects from a collection of predefined profiles. 

1 0 (e) The user is queried and the appropriate predefined profile is determined, 

(f) The control module leams from the user and adapts accordingly. 



The automation mode is driven by predefined profiles and/or a process 
of learning and adjustment as specified by the learning routine 414. 

15 

The capture and compression module 302 is one of the modules which 
operates under the direction and control of control module 320. The module 302 is 
responsible for receiving information, e.g., analog and/or digital A/V information to 
be archived, digitizing it if required, and for encoding the received information using 

2 0 the IAF supported encoding format, e.g., MPEG, JPEG, DV, etc., indicated by the 

control module 320. Data compression may, and in many embodiments is, performed 
as part of the encoding process. Encoding parameters supplied by the control module 
320 are used to control the amount of data compression, if any, that is performed. 
Data compression may be loss-less, low-loss, or of a rate required to meet a control 

2 5 module 320 or user specified data constraint. 



Fig. 5 illustrates an exemplary capture and compression module 302 
suitable for use in the AVARS 300. The module 302 includes a digital capture/digital 
format conversion module 502, scene analysis module 504, image processor 506, 
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multi-format decoder 519, an IAF encoder 510 and an output selector 521. Analog 
audio and video is input to a capture device, e.g., sampling circuit included in module 
502, and converted to digital form. The module 502 also includes an unencoded 
digital A/V input for receiving unencoded digital data. Such digital data is processed 
5 by formatting circuitry included in the module 502 to insure that it is placed in the 
same standardized format expected by downstream components that would be 
produced by the module's processing of analog input. The capture and compression 
module also includes a compressed, e.g., encoded digital A/V input which is coupled 
to the multi-format decoder 519 and an output selector 524. In the case where the 
input is already in a compressed IAF format, the output selector may, under the 
direction of control module 320 select to use the original input data as the compressed 
digital output of the capture and compression module 302. Whether or not the 
compressed digital A/V is in an IAF compliant format, it is supplied to the multi- 
format decoder 519 so, at a minimum, it can be processed for metadata generation 
15 purposes. 

The multi-format decoder 519 includes a plurality of full or partial 
decoder circuits 520, 522 each of which is designed to decode data which has been 
encoded according to a different encoding scheme. For example, decoder 520 may be 

0 an MPEG decoder, decoder 522 a JPEG decoder, with other decoders supporting yet 
other encoding formats. While the received compressed input is supplied to each of 
the decoders 520, 522, the data analysis/decoder control module, working in 
conjunction with control module 320, determines the type of decoding to be used with 
any particular set of input data. With the type of decoding to be performed 

5 determined, the control module 521 enables via a control signal the appropriate one of 
the decoders 520, 522 so that one of the plurality of decoders is used to generate the 
decoded digital A/V data supplied to the scene analysis module 504 when encoded 
input data is being processed. 
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The unencoded digital data output by the digital capture/digital format 
conversion module 302 and multiformat decoder 519 are supplied as input to the 
scene analysis module 504. The analysis module 504 analyzes each frame in digital 
data for the purpose of classification according to such things as scene type or coding 
complexity. Frames are classified according to such things as complexity to encode, 
general characteristics of the frame, potential point of scene change, pictorial content, 
etc. The frames are tagged with the classification and the tag is associated with the 
encoded bitstream using the mechanism of metadata. The classification can be used 
to assist any further processing of the archive material and/or for selecting images to 
be previewed after IAF encoding or transcoding. One particularly important use of 
the generated metadata is by the transcoder in making encoding format output 
suggestions/recommendations and/or format selections. 

In addition to the frame analysis and classification, other types of 
descriptive information about the source content are generated and output by the 
scene analysis module 504 in the form of metadata. 

The results of the analysis performed by module 504 are metadata 
which can, and is, linked to the images to which it relates as part of an encoding 
process. At retrieval time, the metadata can be used to identify and retrieve frames 
with particular characteristics. During the capture and compression stage, some 
metadata is also captured or dynamically created. This metadata, along with the 
metadata created by the analysis module, is linked to and/or embedded into the 
archive file created from the A/V data being processed. 

The metadata and processed digital data output by the scene analysis 
module 502 is supplied to an image processor 506 which can perform one or more 
image filtering and/or other processing operations on frames prior to encoding. A 
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metadata output of the image processor 506 serves as the metadata output of the 
capture and compression module 502. 



The processed digital data, e.g., A/V data output by the image 
5 processor is supplied to an IAF encoder 510 which supports multiple standard 
encoding formats, e.g., MPEG, JPEG which are supported by IAF. To support 
multiple encoding formats the IAF encoder includes a plurality of encoders 530, 532 
with each encoder corresponding to a different encoding format. An IAF encoder 
control module 540 determines, in conjunction with the AVARS control module 320 
0 which encoding scheme will be used for a given set of input data. 

The use of the control modules to control the particular encoding 
scheme provides smart capture and encoding. The AVARS 300 includes intelligence 
in the form of control modules 320, 540 that enables it to choose an appropriate 

5 compression format and associated parameters based on user input or pre-defined 
profiles or learned behavior. The user can directly input specific parameters or there 
can be established a dialog whereby the user is queried about needs and concerns 
such as acceptable quality level, storage availability, and anticipated applications, 
from which the appropriate parameters are automatically determined. In addition or 

0 alternatively, the user can be queried about more abstract concepts such as general 
content description and anticipated use. The appropriate encoding format to be used 
for IAF file creation and parameters will be automatically generated from this dialog. 
There is also the capability to learn and automatically adapt parameters over large 
runs of captures and encodings. 

5 

Once encoded, the encoded data is supplied to output selector 524. 
Output selector 524, under the direction of control module 320, selects a source of 
encoded digital A/V to serve as the encoded digital A/V output by the capture and 
compression module 302. In the case where compressed digital A/V is input to the 
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module 302, the output selector selects between the compressed digital A/V input and 
the output of the IAF encoder 510 to serve as the encoded A/V output of the module 
302. The output selector further includes an A/V preview window to which it supplies 
decoded A/V or encoded A/V depending on the preview window device's needs as 
5 determined by the control module 320. 

In the AVARS 300, the encoded digital data generated by the capture 
and compression module 302 and associated metadata are supplied to the IAF 
wrapper module 304. The IAF wrapper module 304 receives the compressed material 
10 and metadata and reformats as needed into the appropriate storage IAF package 

format to form an IAF file. The IAF wrapper module interacts with the analysis and 
indexing module 314 which is used to perform post encoding analysis. Encoded data 
being placed into IAF files is supplied to the analysis and indexing module prior to 
the IAF file including the encoded data being created. 

15 

The analysis and indexing module 314 receives audio/video material 
in compressed digital form, and uses a decompression, e.g., decoding circuit 319, to 
decompress the received data. It than analyzes and indexes the received data, e.g., 
using indexer 321, to create index information which can be used in searching and 

2 0 accessing the encoded data. The analysis and indexing module return the index 

information to the IAF module which is responsible for formatting it for inclusion in 
an IAF file with the corresponding encoded data. A second copy of the index 
information is passed to a global database manager 307 of the archive storage 
manager for use in creating or updating a global index into the full contents of the 

2 5 archive stored on the archive media 308. Because the indexer receives its input after 
compression, it can, and often does, run asynchronous to the capture and compression 
process. Thus the analysis and indexing module 314 can run faster or slower than the 
compression module 302, process the encoded data once or multiple times, at the 
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current time or at some later time, as automatically determined or specifically 
requested by the user. 

The analysis and indexing module 314 can also retrieve existing 
archived IAF file content directly through the archive storage manager 306 thereby 
allowing indexing or re-indexing to be done at any time, even after the encoded data 
is stored in the archive and is no longer available from the IAF wrapper module 304. 
Re-indexing allows new features of the archived information to be indexed well after 
creation of the initial encoded data file. The analysis and indexing, and further re- 
indexing, can be performed manually by a user coupled to the analysis and indexing 
module 3 14 via user input/output device 318 or can be performed in an automated 
fashion whereby the process is driven by an indexing routine that is predefined or 
adaptable to the task at hand. The analysis and indexing module also includes a 
learning algorithm that permits the indexing routine to adapt over several runs, e.g., 
based on feedback which includes at least some results from one of the indexing runs. 
Another way to set indexing parameters is by stored profiles selected according to 
anticipated end use of the material, where these can be selected manually or by the 
user and then automatically applied. 

Processing by the IAF wrapper module 304, e.g., creating the correct 
wrapper, e.g., placing data in the correct file format, is accomplished by performing a 
formatting operation and adding any headers, additional metadata, and/or ancillary 
data as needed or desired before being output as an IAF file for storage. In addition, 
the wrapper module 304 can send the content to the analysis and indexing module 
314 and receive back the indexing information which it can then link to and/or embed 
into the encoded material. 

The output of the IAF wrapper module 304 is a self-contained archive 
entry, that is, an IAF file. The IAF file is supplied to the archive storage manager 306 
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for transfer to the archive media 308. The LAP file includes the bits needed for 
decoding and play back as well as metadata and indexing information sufficient to 
support later search and retrieval processes or other understanding or analysis of the 
material. 

5 

The LAP file or files generated by the LAP wrapper module are 
received by the archive storage manager 306 for storage on the archive media 308. 
The archive storage manage includes a global database interface module 307 
responsible for creating and maintaining a global database including indexes used for 

1 0 accessing the various LAP files stored on the archive media 308. The archive storage 
manager controls both read and write access to the archive storage media. Within this 
component resides the global database interface module that controls all access to the 
master database in the archive. The global database interface module 307 adds the 
local records specific to each archive entry to the global database and updates any 

1 5 global entries in that global database. Database entries are delivered to requested 

processes for search across archive entries, search within entries, and then subsequent 
retrieval of requested material. 

Fig. 7 illustrates an exemplary LAP file 700 which may be stored on 
2 0 the archive media 308 as part of an archive. The IAF file 700 includes structural 
metadata 700, an LAP file index table 702, indexing information 704, digital rights 
information 706, descriptive metadata 708, time codes 710, captions 712 and encoded 
audio and/or video packets 714. The descriptive metadata 708, time codes 710, and 
captions 712 may be interleaved with the actual data packets 714 as desired or useful. 

25 

An LAF file, in accordance with the present invention, comprises two 
or more of the following elements multiplexed into a single accessible data package. 
• One or more video streams, e.g., organized as a series of packets 
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One or more audio streams, e.g., organized as a series of packets 

Still images 

Graphics 

Captions, e.g., text or closed captions associated with the video 
Timecodes, e.g., as per SMPTE standard, these can be used for 
synchronization and as access points 
Database records (from indexing process) 

Structural metadata, general information about the structure of the file content, 
the structure of the IAF file and the encoding, it includes such things as, e.g., 
title, author, date, topic global description, data encoding type, parameters, 
how many video streams are included in the file, how many audio streams, 
etc. 

Descriptive metadata, e.g., detailed descriptions of the encoded file content 
Index tables to facilitate efficient random access into the file and/or other 
maintenance and control information, the index tables may be implemented 
as, e.g., one or more time code to byte offset lookup tables 
Indexing information which is the local copy of the databse entry including 
information generated by one or more data indexing operations which is to be 
used to support search and retrieval operations 

Digital rights information which includes information for digital rights 
management such as ownership, encryption, distribution rights, etc. 



Each such IAF file, serves as an entry in the archive that is stored on 
the archive media. 

25 

The descriptive metadata, time codes, captions and audio/video stream 
fields of an IAF file may appear several times in the file and may be interleaved with 
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each other if desired and/or as necessary to keep the fields and particularly 
descriptive metadata close to the audio/video content to which it pertains. 

Fig. 8 illustrates an exemplary archive 800 created in accordance with 
5 the present invention. The archive includes one or more global archive index tables 
802, global database records 804 and IAF files 806. The global archive index tables 
802 are tables which include information which enables random access into the 
archive elements such as the global database, IAF files, and may include specific 
byte offsets into those files to access specific data. The global database records 804 
1 0 include copies of the indexing information included in each of the IAF files in the 
archive. 

Once an archive has been created and stored on the archive media 308, 
data may be searched, retrieved from the archive, and delivered in one or more 
1 5 formats, e.g., encoded or unencoded formats, to the delivery system 324. 

In order to support data retrieval and outputting in one or more 
formats, the AVARS 300 includes search and retrieval module 315, transcoder 316 
and preview module 322 in addition to the archive storage manager 306. The search 

2 0 and retrieval module 3 1 5 interacts with a user via control module 320 and user 

input/output device 3 1 8. The search and retrieval module has the ability to access 
and retrieve data from the archive by interacting with the archive storage manage 
which is responsible for overseeing the actual retrieval of data from the archive. In 
response to retrieval command from the search and retrieval module 315, the archive 

2 5 storage manager 306 will deliver retrieved IAF files to the transcoder 3 1 6 for further 
processing. 

The search and retrieval module 315 allows a user to perform searches 
over the archive entries and within entries, delivers low-resolution, low-bandwidth 
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preview audio-video bitstreams for browsing preview to the preview module 322, and 
displays those frames as requested by the user. Images can be selected by a user for 
retrieval and output by a unique frame number assigned to the frame as part of the 
archive index creation process or can be indicated by content. The search and 
5 retrieval module 3 1 5 includes search routines and/or processing circuitry suitable for 
accessing database and/or fde indexes to identify frames based on user specified 
content. The search and retrieval module also includes control circuitry capable of 
initiating the display of identified frames with or without associated audio. 

1 o Data selected to be retrieved and output is read from the archive by the 

archive storage manager 306 and supplied to the transcoder 316 for conversion, if 
required, into the appropriate output format, e.g., one or more output formats 
specified by a system user. Operation in terms of output format control may be by 
way of user input supplied to control module 320 which interacts with the transcoder 
15 316. 

Fig. 6 illustrates an exemplary transcoder 316 which may be used in 
the AVARS 300. As illustrated the transcoder includes a multiformat decoder, image 
processor and encoder circuitry 606 which supports a plurality of encoder formats. 

2 0 The multiformat decoder receives IAF encoded digital data, e.g., one or more 

retrieved IAF files. It performs a full or partial decoding operation on the received 
data under control of signals from the control module. Decoded A/V information is 
supplied to the second preview module 322 for optional viewing of retrieved data and 
to an image processor 604 for processing of the decoded image data prior to re- 
2 5 encoding by one or more of the encoders 606. 



The image processor 604 performs one or more image filtering 
operations or other image processing operations on the decoded image data under the 
direction of the control module 320 prior to the processed image data being supplied 
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to encoder circuitry 606. Encoder circuitry 606 includes a plurality of M encoders 
608, 610 each supporting a different encoding format. An encoder control module 
612 interfaces with the AVARS control module 320 to determine which encoders 
608, 610 should be enabled based on the user specified output format(s) and encoding 
5 parameters such as resolution and output data constraints. When multiple encoded 
output formats are specified, the encoder 608, 610 supporting each of the specified 
formats will be enabled. In the case where the user selected output is similar to the 
retrieved IAF file format only partial decoding and re-encoding may be required to 
convert the retrieved file into the user specified output format. Accordingly, some of 
10 the encoders 608, 610 may be partial as opposed to full video encoders. 



As part of the encoding process performed by the encoders 608, 610, 
encoded data may be decoded as is known in the video encoder art. The decoded 
video data produced from the video data encoded to the output format may be 
1 5 supplied to the second preview module 322 to allow the user to view the retrieved and 
transcoded data as it would appear to an end user receiving the data. 
Encoded digital data generated by the transcoding operation is output by the enabled 
encoders 608, 610 to the delivery system 324. 



2 o Thus, the transcoder 3 1 6 is used for accepting retrieved material in 

IAF format and transcoding, e.g., digitally converting the bitstream format, to any 
explicitly requested delivery or to an intelligently determined format based on 
information provided by a user or set by an automated process. This component is 
capable of changing archived material into any end-user format on demand and in 

2 5 real-time. The transcoder inputs IAF compressed data, decodes according to its 

format, processes the frames as necessary, then re-encodes into one or more output 
compressed formats. The user interacts with this component through an interface 
controlled by control module 320 that can be as simple as the direct selection of 
format and parameters to transcode, or a query mechanism that can choose the format 
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and parameters appropriate for the anticipated use of the material, or can choose the 
format and parameters as preset for particular applications, or can run in an automated 
fashion with minimal user input. There is also a learning algorithm in this component 
so it can adapt over time to the needs of the user and applications. There is also a 
5 mechanism for selecting and previewing the transcoded material before the entire 
content is converted. This mechanism includes a user interface for selecting what 
frames to transcode and display and then interactively modify settings and parameters 
until the desired result is achieved. Additional intelligence allows for the transcoding 
and display to be done not just from the beginning of the material, but from key 
1 0 frames within the material that can be explicitly chosen by the user or automatically 
found according to criteria such as content or complexity of encoding. The tags that 
find these critical frames are those put in by the capture and compression component. 

The output of the transcoder and the final delivery of retrieved 
archived information by the delivery system 324 may be as a compressed bitstream, 
uncompressed digital audio/video, or analog audio/video, plus additional descriptive 
material derived from the metadata stored in the archive. The delivered material can 
be in any of several forms, including but not limited to a digital file written to some 
removable storage, analog material written to tape, digital delivery over a network, 
delivery through other types of transmission, or just analog (or digital) delivery to a 
display device. The delivered material may comprise individual elements within an 
archive or a combination of elements presented in a synchronized way. 

In various embodiments the transcoder 3 1 6 and/or the control module 
2 5 320 has intelligence that enables it to choose an appropriate compression format and 
associated parameters based on user input or pre-defined profiles or learned behavior. 
The user can directly input specific parameters or there can be established a dialog 
whereby the user is queried about needs and concerns such as acceptable quality 
level, deliver requirements, and anticipated applications, from which the appropriate 
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parameters are automatically determined. Alternatively, the user can be queried about 
more abstract concepts such as general content description and anticipated use and the 
appropriate encoding format and parameters will be automatically generated. 

5 Another feature of the transcoder 3 1 6 and/or control module 320 is the 

ability to assist the user in assessing the quality and suitability of a transcoding format 
by allowing the user to preview a sample transcoding of some of the requested 
material via interaction with preview module 322. The representative piece is 
intelligently chosen to show the user how the transcoding will affect specific critical 

1 0 frames of the material. This is accomplished using the characteristic analysis and 

tagging done by the aforementioned smart capture and encoding. The tags allows the 
transcoder 316 to display the results of transcoding those frames with special 
relevance, e.g., hard-to-encode frames will show what the worst frames are expected 
to look like, important scene types, as perhaps specified by the user, can be checked 

1 5 for quality, and so on. Other intelligence in the transcoder 3 1 6 is the ability to allow 
the user to request transcoding and display of material not by specific frame numbers 
but by descriptions of the content. Yet another feature of the transcoder is the use of 
metadata, put in by other elements of the system, that assists the transcoding to be 
better or faster. For example, time is saved if statistics gathered by an earlier encoding 

2 0 process are made available to the transcoder 316 instead of it having to regenerate 
these. Provision is made for the inclusion in the archive and subsequent presentation 
to the transcoder 316 of various types of proprietary information to assist the smart 
transcoder perform its task. 

25 As mentioned above, the archiving and retrival methods of the present 

invention can be implemented using software executed on a computer. Fig. 9 
illustrates a computer based AVARS 900. The AVARS 900 includes a CPU 902 for 
executing routines stored in memory 923, a bus 903, a user input device 910 for 
receiving user input, an output device 912, e.g., display, an archive media 906 and an 
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I/O interface 904. The I/O interface is responsible for receiving input from external 
sources and for outputting information, e.g., signals and/or files, generated by the 
AVARS 900. The I/O interface may receive one or more of the following: metadata, 
an analog A/V signal, encoded digital A/V signal and user input. The I/O interface 
5 904 outputs retrieved and transcoded A/V signals to the delivery system 324. 

The AVARS CPU 902 is responsible for controlling operation of the 
AVARS 900 under direction of routines stored in memory 923. The stored routines 
which include software instructions are illustrated in the form of modules, e.g., 

1 0 operating system module 920, capture and compression module 922, control module 
924, preview module 926, IAF wrapper module 927, analysis & indexing module 
928, search and retrieval module 932 archive storage manage module 930, and 
transcoder module 934. Each of the modules is responsible for control the CPU 902 
to perform functions which are the same as or similar to the functions performed by 

15 the previously discussed modules bearing the same name. In addition to the various 
modules memory 924 may include one or more sets of parameters and/or other 
encoding information. 

In view of the above discussion it will be apparent that the archiving 
20 and retrieval techniques of the present invention can be implemented using any 
desired combination of software and/or hardware to implement the methods of the 
present invention. In addition, numerous variations on the above described methods 
and apparatus are possible without departing from the scope of the invention. For 
example, the encoding formats supported as part of the IAF group of formats may 
2 5 include a wide range of different encoding schemes relating to various applications 
for example, two or more of the following formats may be supported by an archive 
system as part of the group of IAF supported encoding formats. 
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o Uncompressed digital audio/video 
o MPEG-1 MPEG-2, MPEG-4 

o DV, DVCAM, DVCPRO, DVCPRO-50, DCVPRO-HD (DV is the 

standard used for compressing and recording digital video onto 4mm tape 
as used by digital camcorders) 

o H.263, H.263++, H.26L 

o JPEG, JPEG-2000, JPEG-LS, MJPEG, MJPEG-2000 

o MPEG Audio, MP3,AAC,AC-3 

- The following are for transcoder output only - 

o QuickTime 

o Windows Media 

o Real 

As the new formats become standards, additional formats may be 
supported as part of the IAF supported group of encoding formats. 

Furthermore, with regard to the transcoder, user selection and/or 
automatic control may be used for selecting the encoding format and parameters 
(such as bitrate, frame size, frame rate, GOP structure, etc) used for re-encoding data, 
and also for selecting what frames to transcode, e.g., only a segment of the data in an 
IAF file may be transcoded and output. The user or data retrieval module can choose 
for transcoding purposes, frames based on frame's number in an encoded sequence or 
based on a content description, coding complexity, or a measure of moving average 
bitrate, or some other criteria. Furthermore in various embodiments a user can 
interact with the transcoder by, e.g., indicating a range of frames to transcode (e.g. by 
interacting with a preview window and the retrieval module), see the transcoding 
result in the preview window coupled to the transcode, move to another range of 
frames, see that output, and so on. The user interface supported by the retrieval 
module and preview window supports graphical tools for displaying tagged or 
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calculated information along with the decoded IAF file data and for assisting a user 
with selecting test segments for transcoding and viewing the result. As part of the 
trancoding process, a user can interactively change encoding settings/parameters on 
the fly and see the effect of the changes in the preview window if desired. 

5 

As discussed above, analysis and metadata creation is done prior to 
and as part of the IAF data encoding process. For archived data which was not 
tagged prior to archiving, generation of such metadata may be performed at 
transcoding time and then stored in the archive file from which the encoded data, 
1 0 being processed by the transcoder, was retrieved. 
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WHAT IS CLAIMED IS: 

1 1 . A method of operating a system to process image data for storage and retrieval, 

2 the method comprising the steps of: 

3 analyzing said image data to be encoded to determine, for each image 

4 represented by the image data, a level of encoding complexity; 

5 encoding said image data according to a first encoding format to generate 

6 first encoded image data; and 

7 storing with the first encoded image data encoding complexity level 

8 information indicating at least one determined level of encoding complexity associated 

9 with the first encoded image data. 

1 2. The method of claim 1 , further comprising the step of: 

2 retrieving the first encoded image data and encoding complexity level 

3 information from the storage device; 

4 decoding the first encoded image data to generate decoded image data; 

5 determining at least one encoding parameter to be used to re-encode the 

6 decoded image data as a function of the retrieved encoding complexity level information; 

7 and 

8 re-encoding the generated decoded image data using the encoding 

9 parameter determined as a function of the retrieved encoding complexity level 
10 information. 

1 3 . The method of claim 1 , further comprising the steps of: 

2 performing an automated image content analysis operation on at least one 

3 image represented by said image data; and 

4 storing, with the first encoded image data, image content description 

5 information generated by performing said content analysis operation. 



WO 02/087255 



PCT/US02/12423 



4. The method of claim 3, further comprising the steps of: 

receiving image content information from a user of the system; and 
storing, with the first encoded image data and the image content 
description information generated by performing said content analysis operation, the 
image content description information received from the system user. 

5 . The method of claim 3 further comprising the steps of: 

retrieving the first encoded image data and image content description 
information from the storage device; 

decoding the first encoded image data to generate decoded image data; 

deteirnining at least one encoding parameter to be used to re-encode the 
decoded image data as a function of the retrieved image content description information; 
and 

re-encoding the generated decoded image data using the encoding 
parameter determined as a function of the retrieved image content description 
information. 

6. The method of claim 1 , further comprising the step of: 

selecting, based on the determined encoding complexity level information, 
an image represented by the first encoded image data, to be viewed after decoding. 

7. The method of claim 6, further comprising the step of: 

decoding the encoded image data representing the selected image to 
generate decoded image data; and 

displaying the decoded selected image on a display device. 

8 . The method of claim 7, wherein said step of decoding the encoded image data is 
performed as part of said encoding step. 



1 
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1 9. The method of claim 1 , further comprising: 

2 selecting the first encoding format from a plurality of supported encoding 

3 formats, as a function of the determined level of encoding complexity. 

1 10. The method of claim 9, further comprising the step of: 

2 receiving data storage limitation information; and 

3 wherein the step of selecting the first encoding format is also performed as 

4 a function of the received data storage limitation information. 

1 11. The method of claim 1 , further comprising the step of: 

2 retrieving the first encoded image data and encoding complexity level 

3 information; and 

4 using the retrieved encoding complexity level information to identify at 

5 least one data format suitable for distributing an image represented by the first encoded 

6 image data, the identified data format being different from the first encoding format. 

1 12. The method of claim 1 1 , further comprising the step of: 

2 presenting to a user of the system a plurality of data formats suitable for 

3 distributing the image represented by the first encoded image data, the identified data 

4 format being one of said plurality of presented data formats; 

5 receiving from the user information selecting one of the presented data 

6 formats for use in distributing the image; and 

7 converting at least a portion of the first encoded image data from the first 

8 encoding format to the user selected data format. 

1 13. The method of claim 1 , wherein the step of analyzing image data to be encoded to 

2 determine, for each image represented by the image data, a level of encoding complexity 

3 includes: 

4 generating an activity measure for at least one image represented by said 



WO 02/087255 



PCT/US02/12423 



-43- 

1 14. The method of claim 1 , wherein the step of analyzing image data to be encoded to 

2 determine, for each image represented by the image data, a level of encoding complexity 

3 includes: 

4 generating a measure of the luminance variance throughout at least one 

5 image represented by said image data. 

1 15. The method of claim 1 , wherein the step of analyzing image data to be encoded to 

2 determine, for each image represented by the image data, a level of encoding complexity 

3 includes: 

4 generating a measure of the chrominance variance throughout at least one 

5 image represented by said image data. 

1 16. The method of claim 1 , wherein the step of analyzing image data to be encoded to 

2 determine, for each image represented by the image data, a level of encoding complexity 

3 includes: 

4 generating a measure of the motion between at least two complete frames, 

5 each frame corresponding to a different image. 

1 17. The method of claim 1, further comprising: 

2 generating true motion vectors indicating motion between a first image 

3 and a second image; 

4 generating in accordance with the first encoding format, as part of said 

5 step of encoding said image data, a set of motion vectors indicating motion between said 

6 first image and said second image, said set of motion vectors including at least some 

7 motion vectors which are different form said true motion vectors; and 

8 storing the true motion vectors with the first encoded image data which 

9 includes said set of motion vectors. 



1 18. A method of operating a system to process image data for storage and retrieval, 

2 the method comprising the steps of: 
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3 performing an automated scene analysis operation on said image data to 

4 be encoded to generate image content information; 

5 encoding said image data according to a first encoding format to generate 

6 first encoded image data; and 

7 storing the generated image content information in a file with the first 

8 encoded image data. 

1 19. The method of claim 1 8, further comprising: 

2 receiving additional image content information from a user of the system; 

3 storing the additional image content information in said file with the first 

4 encoded image data; 

5 retrieving from storage the stored first encoded image data, said generated 

6 content information, and said additional content information; 

7 selecting a second encoding format to be used for outputting images 

8 represented by said first encoded image data as a function of at least one of said 

9 generated content information and said additional content information; and 

I o re-encoding said first encoded image data to said second encoding format 

II to generate second encoded image data. 

1 20. The method of claim 1 9, wherein the generated image content information 

2 indicates the pictorial content of an image. 

1 21. The method of claim 20, wherein the additional image content information 

2 includes a description of the pictorial content of at least one image. 

1 22. The method of claim 20, wherein the step of re-encoding said first encoded image 

2 data includes: 

3 decoding said first encoded image data to generate decoded image data; 

4 and 
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5 re-encoding said first encoded image data using at least one encoding 

6 parameter generated as a function of said generated image content information. 

1 23 . A system for processing image data for storage and retrieval purposes, the system 

2 comprising: 

3 a scene analysis module for performing scene analysis on said image data 

4 to generate image content information; 

5 means for receiving additional image content information from a user of 

6 the system; 

7 an encoder for encoding said image data according to a first encoding 

8 format to generate first encoded image data; and 

9 a storage device for storing the first encoded image data, said generated 
10 image content information and said additional image content information in a file. 

1 24. The system of claim 23 , further comprising: 

2 means for retrieving from storage the stored first encoded image data, said 

3 generated content information, and said additional content information; 

4 means for selecting, as a function of at least one of said generated content 

5 information and said additional content information, a second encoding format to be used 

6 for outputting images represented by said first encoded image data; and 

7 a second encoder for re-encoding said first encoded image data to said 

8 second encoding format to generate second encoded image data. 

1 25 . A system for processing data including at least one of image data and audio data, 

2 the system comprising: 

3 an analysis module for analyzing data to be encoded and to assign one of a 

4 plurality of encoding levels of complexity to the data to be encoded; 

5 an encoder for generating encoded data from said data to be encoded; 
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6 a file wrapper module for incorporating an encoding complexity level 

7 identifier indicating the encoding complexity level assigned to said data to be encoded 

8 and said encoded data into a single file; and 

9 a data storage device for storing said single file. 

1 26. The system of claim 25, wherein the encoded data is encoded image data, the 

2 system further comprising: 

3 a preview module coupled to said encoder and said analysis module for 

4 displaying a subset of the images represented by encoded data generated by said encoder, 

5 the displayed images being selected for display as a function of encoding complexity 

6 level information associated with said encoded data. 



1 27. The system of claim 20, wherein the preview module selects images represented 

2 by image data assigned a higher than average encoding complexity level for display. 
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